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420 aa 



C20orfl3 420 aa 

Threonine aspartase 1, an endopeptidase containing an active site threonine that hydrolyzes substrates 
following aspartate, proteolytically cleaves and activates the MLL transcription factor which allows for 
proper homeobox gene expression 



Match: Length=420, Identity: 99%, Similarity:99%, Query Overlap: 100%, Subject Overlap: 100%, E- 
value:0.0, Score:838 



Query: 1 MTMEKGMSSGEGLPSRSSQVSAGKITAKELETKQSYKEKRGGFVLVHAGAGYHSESKAKE 60 

MTMEKGMSSGEGLPSRSSQVSAGKITAKELETKQSYKEKRGGFVLVHAGAGYHSESKAKE 

Sbjct: 1 MTMEKGMSSGEGLPSRSSQVSAGKITAKELETKQSYKEKRGGFVLVHAGAGYHSESKAKE 60 

Query: 61 YKHVCKRACQKAIEKLQAGALATDAVTAALVELEDSPFTNAGMGSNLNLLGEIECDASIM 120 

YKHVCKRACQKAI EKLQAGALATDAVTAALVELEDS PFTNAGMGSNLNLLGE I ECDAS IM 

Sbjct: 61 YKHVCKRACQKAI EKLQAGALATDAVTAALVELEDS PFTNAGMGSNLNLLGE I EC DAS I M 120 

Query: 121 DGKSLNFGAVGALSGIKNPVSVANRLLCEGQKGKLSAGRIPPCFLVGEGAYRWAVDHGIP 180 

DGKSLNFGAVGALSGIKNPVSVANRLLCEGQKGKLSAGRIPPCFLVGEGAYRWAVDHGIP 

Sbjct: 121 DGKSLNFGAVGALSGIKNPVSVANRLLCEGQKGKLSAGRIPPCFLVGEGAYRWAVDHGIP 180 

Query: 181 SCPPNIMTTRFSLAAFKRNKRKLELAERVDTDFMQLKKRRQSSEKENDSGTLDTVGAVVV 240 

SCPPNIMTTRFSLAAFKRNKRKLELAERVDTDFMQLKKRRQSSEKENDSGTLDTVGAVVV 

Sbjct: 181 SCPPNIMTTRFSLAAFKRNKRKLELAERVDTDFMQLKKRRQSSEKENDSGTLDTVGAVVV 240 

Query: 241 DHEGNVAAAVSSGGLALKHPGRVGQAALYGCGCWAENTGAHNPYSTAVSTSGCGEHLVRT 300 

DHEGNVAAAVSSGGLALKHPGRVGQAALYGCGCWAENTGAHNPYSTAVSTSGCGEHLVRT 

Sbjct: 241 DHEGNVAAAVSSGGLALKHPGRVGQAALYGCGCWAENTGAHNPYSTAVSTSGCGEHLVRT 300 

Query: 301 ILARECSHALQAEDAHQALLETMQNKFISSPFLASEDGVLGGVIVLRSCRCSAEPDSSQN 360 

ILARECSHALQAEDAHQALLETMQNKFISSPFLASEDGVLGGVIVLRSCRCSAEPD SQN 

Sbjct: 301 ILARECSHALQAEDAHQALLETMQNKFISSPFLASEDGVLGGVIVLRSCRCSAEPDFSQN 360 

Query: 361 KQTLLVEFLWSHTTESMCVGYMSAQDGKAKTHISRLPPGAVAGQSVAIEGGVCRLESPVN 420 

KQTLLVEFLWSHTTESMCVGYMSAQDGKAKTHISRLPPGAVAGQSVAIEGGVCRLESPVN 

Sbjct: 361 KQTLLVEFLWSHTTESMCVGYMSAQDGKAKTHISRLPPGAVAGQSVAIEGGVCRLESPVN 420 
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BKL Annotation details of one docket matching protein 

Human C20orfl3 Threonine aspartase 1, an endopeptidase containing an active site 
threonine that hydrolyzes substrates following aspartate, proteolytically cleaves and activates the 
MLL transcription factor which allows for proper homeobox gene expression 



Disease 
Phenotype 

Title line TitleLine Phrases: 

phrases Pfam Title Here], has moderate similarity to human ASRGL1, which [...]member 

of the asparaginase family, has low similarity to aspartylglucosaminidase (human 
AGA), which catalyzes cleavage of amide bonds between asparagine and 
oligosaccharide chains in N-linked glycoproteins and is associated with ( No ref.) 
Name Phrase: 

Threonine aspartase 1 2003 (14636557^ 
Molecular/Biochemical Function: 

contains an active site threonine and hydrolyzes peptide substrates following 
aspartate residues 2003 (14636557 ) 

likely undergoes autoprocessing to generate 22 kDa and 28 kDa subunits which 
reassemble to form an active protease 2 00 3 (14636557) 

proteolytically cleaves and activates the MLL transcription factor, which allows 
for proper homeobox gene expression 2003 ( 14636557) 

Synonyms FLJ20212 
dJ585I14.2 



Cognate 
members 



Mouse 4930485D02Rik 



GO 



GO ontology: 
GO ontology: 
GO ontology: 
GO ontologv: 

(14636557)* 

GO ontology: 
GO ontology: 
GO ontology: 



endopeptidase activity Experimental (E) 2003 ( 14636557) 
membrane fraction Experimental (E) 2003 (14636557) 
proteolysis and peptidolysis Experimental (E) 2003 ( 14636557) 
regulation of transcription from Pol II promoter Experimental (E) 2003 



protein autoprocessing Predicted (P) 2003 (14636557) 
segment specification Predicted (P) 2003 (14636557) 
protein processing Experimental (E) 2003 (14636557) 
Protein modifications (fungi, worm): protein autoprocessing Predicted (P) 2003 
(14636557) 

Protein modifications (fungi, worm): endoproteolysis Experimental (E) 2003 
(1 46 3655 7) 



Expression Body: cervix * Tumors: carcinoma * Cell origin: cell line * Techniques: protein 
activity detected Experimental (E) 2003 (J 4636557^ 

Body: kidney * developmental stage: embryo-fetus * Cell origin: cell line * 
Techniques: protein purification Experimental (E) 2003 ( 14636557) 
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for Cleavage of MLL 

and Proper HOX Gene Expression 
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Summary 



The Mbc&d-Uneago Leukemia gene WLUHRX/ALL1) 
encodes a large nuclear protein homologous to Dro- 
sophila trtthorax that Is required for the maintenance 
of HOX gene expression. MLL is cleaved at two con- 
served sites generating N320 and C180 fragments, 
which heterodimerize to stabilize the complex and 
confer its subnuclear destination. Here, we purify and 
clone the protease responsible for cleaving MLL. We 
entitle it Taspasel as It initiates a class of endopepti- 
dases that utilize an N-terminal threonine as the active 
site nucleophlle to proteolyze polypeptide substrates 
following aspartate. Taspasel proenzyme is Intra- 
molecularty proteotyzed generating an active 28 kDa 
a/22 kDa p heterodimer. RNAI-medfated knockdown 
of Taspasel results in the appearance of unprocessed 
MLL and the loss of proper HOX gene expression. 
Taspasel coevolved with MLUtrfthorax as Arthropods 
and Chordata emerged from Metazoa suggesting that 
Taspasel originated to regulate complex segmental 
body plans in higher organisms. 

Introduction 

MLL/HRX/ALL 1 encodes a 3969 amino acid nuclear pro- 
tein bearing multiple conserved domains with assigned 
activities Including an N terminus with three AT-hook 
motifs that bind AT rich DNA segments (Zeleznik-Le et 
al., 1994), a DNA methyl transferase homology domain 
that represses transcription (Xfa et al., 2003), four PHD 
fingers that mediate protein-protein Interactions (Fair et 
al„ 2001), a transactivation domain that interacts with 
CBP (Ernst et al., 2001), and a C-termlnal SET domain 
with histone H3 lysine 4 methyl transferase activity 
(Mflne et al., 2002; Nakamura et al., 2002) (Figure 1A). 
MLL and its Drosophila homolog trithorax are required 
for maintaining proper Hex and homeotlc gene expres- 
sion patterns, respectively (Breen and Harte, 1993; Yu 
etal., 1998). 

Chromosome translocations characteristically found 
in human Infant leukemia disrupt MLL (1 1 q23), generat- 
ing chimeric proteins between the MLL N temilnus and 
multiple translocation partners that vary substantially 
(Ayton and Cleary, 2001; Domer et al., 1993; Downing 
and Look, 1996; Gu et al., 1992; TWrman et al., 1993; 
Tkachuk et al., 1992). Mice carrying engineered Mil 
translocations develop leukemia (Corral et al., 1 996; For- 
ster et al. f 2003). Gene expression profiles of infant leu- 
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kemias bearing MLL translocations identified a charac- 
teristic gene expression profile that distinguishes this 
poor prognosis leukemia from other leukemlas (Arm- 
strong et aL, 2002; Yeoh et al 2002). Among the upregu- 
lated genes were some recognized targets of MLL in- 
cluding select HOX genes. Deregulated expression of 
HOX genes typifies certain malignancies (Buske and 
Humphries, 2000; Cillo et al., 2001; Dash and Gilliland, 
2001). 

Recently, we and others demonstrated that MLL is 
normally processed at two cleavage sites CS1 (D/GADD) 
and CS2 (D/GVDD) and that mutation of both sites abol- 
ishes the proteolysis (Hsieh et al., 2003; Yokoyama et 
al., 2002) (Figure 1 B). The sequence of the cleavage 
site is highly conserved in MLL homofogs from flies 
to mammals. MLL cleavage generates N-terminal p320 
(N320) and C-terminal p160 (C180) fragments, which 
heterodimerize to form a stable complex that localizes to 
a subnuclear compartment. The FYRN domain of N320 
directly interacts with the FYRC and SET domains of 
CI 80. This dynamic postcleavage association confers 
stability to N320 and correct nuclear sublocalization of 
the MLL complex for proper target gene expression 
(Hsieh et al., 2003). 

Site-specific proteolysis is essential In many impor- 
tant biological pathways including the sequential activa- 
tion of blood coagulation factors (Furie and Furle, 1992), 
cholesterol-gauged liberation of SREBP from the ER 
(Brown et al., 2000), llgand -activated cleavage and sub- 
sequent release of the intracellular domain of Notch 
(Brown et al., 2000), maturation of the hedgehog-signal- 
ing molecule (Ye and Fortini, 2000), separation of HCF-1 
for proper cell cycle regulation (Wilson et al., 1995), and 
activation of caspases and their subsequent cleavage 
of death substrates during apoptosls fjhomberry and 
Lazebnlk, 1998). Identification and characterization of 
the responsible proteases has not only proven critical 
to understanding such biologic processes but also for 
developing targeted therapeutics for diseases involving 
specific pathways. To better understand the regulation 
of MLL cleavage and hence its effects on target gene 
expression, we purified the protease responsible for 
MLL cleavage. 

Results 

Site Specific MLL Cleavage Substrate 
Inhibitors to the known classes of proteases showed no 
substantial inhibition of MLL processing within whole 
cells, providing the first suggestion that this cleavage 
activity would prove unique. To facilitate the character- 
ization of the MLL protease, we generated a tractable 
cleavage substrate. We found that aa 2400-2900 of MLL 
containing CS1 (aa 2666) and CS2 (aa 271 8) are sufficient 
to recapitulate endogenous MLL cleavage (Figure 1C). 
Proteolysis of this p75 MLL test substrate at CS1 or 
CS2 would generate N-terminal Myc-tagged p42 or p47 
fragments and C-terminal Rag-tagged p33 or p28, re- 
spectively (Figure 1 A). The p47 and p28 fragments were 



Celt 
294 



n . IMT Bromo CS1 CS2 
DNMT PH04 



FYRC SET 



AT hook 




3969 



hMLL 



B 



CS1 

mm 



mm*. 



C D L 

L fi Q ^. 

A A K M R {■ 



CS2 



:b,G,V.;0/p: 



6- S 

i c l '5 

. ™S I T E 



rr 



D< A 
6 T • 
F R 
A 



p28(CS2 Cleavage) 
P33(CS1 Cleavage) 
=°*fp28 (CS1+2 Cleavage) 



Human MLL 
Mouse MLL 
Puffer fi ah MLL 

Human MLL 
Mouse MLL 
Puffer fish MLL 
Anopheles trithorax 
Drosophila trithorax 
Human MLL 2 




-p75 

. P47 
•p42 




• p75 



. p28 



AntNMyc Western 



Anti-Flag Western 



figure 1. A Cleavage-Site Reporter for MLL Proteolysis 
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Figure 2L In Vitro Reconstitute of MLL Oteavage Using Subcellular Fractions 

(B) None of the JKS^^ Lower level aotovfty afeo exists In the SI 00 cytosollc fraction, 

translated p75 SC ^^^^^ *° "J ^Conine-labeled In vftro transcribed/ 
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metalfoproteases, acid proteases, and the 26S protea- 
some, were examined but again showed no substantial 
Inhibition of MLL proteolysis In this fraction enriched for 
endoplasmic reticulum {Figure 2B). Only heat Incubation 
at 70°C for 30 mln Inactivated the proteolytic activity In 
light membranes. 

Purification 

The LM fraction possessing the protease activity was 
subjected to serial column chromatography and the ac- 



tivity followed by an In vitro cleavage assay. Seven chro- 
matographic steps achieved an approximately 200,000- 
fold enrichment of the proteolytic activity (Figure 3A). 
The ultimate mono S fractions displaying the highest 
enzymatic activity were subjected to SDS-PAGE fol- 
lowed by a silver stain (Figure 3B). Polypeptide bands 
whose presence best correlated with the proteolytic ac- 
tivity were digested with trypsin and subjected to liquid 
chromatography and tandem mass spectrometry (LC- 
MS/MS) sequence analysis. Two peptide sequences 



08/26/2004 10:28 FAX 301 838 0208 



ffl 



TIGR 



@1005 



i 

I 
I 
f: 
II 

|i 



j. 

! | . 



Cell 
286 




B 



0.1 M 



. a 0.75 MKCf 
0.5 M 

0.2 M Hydroxyapatite 

0.5 MK Phosphate MkDa- 

0.1 M " j 
0.01 M Q Sepharose 



MoiioS 

jperJex 200 
I 

Mono Q 
Mono 8 



20kDaH 
14kDa 




*SLNFGAVGALSGIK 
*NPVSVANR 



41 



391 



'420 



Glyoosylasparagfnase 



L-Asparaginase 




a y y v Homo sapiens 
A V Y :;V Mua mus cul u s 
A V V V Rattus norvegicus 
^ A. V y v Fugu rubripes 
T ;.V 'S'A V V V Danio rerio 



A V. C 



Anopheles gambiae 
Drosophila melanogastex 



M/jV.- V .z« Homo sapiens 
M V V -I Mua musculua 
M i .ft '••J . Fugu rubripea 
M.V A V Spodoptera frugiperda 
M. I A L Flavobacterlum meningosepticum 

A V A L Homo sapiens 
A'V A L Mua musculus 
A, V.. A V. Fugu rubripes 
C V ;A ; y: Arabidopsis thaliana 
C IV A V Lupinus arboreus 



Taspasel 



Flag 
=1*420 



o28 



p22 




a £ s p 



Proenzyme 



a26 




Proenzyme 



Antl-Myc Western 



Anti-Flag Western 



Figure 3. Purification and Cloning of the MIX Cleaving Protease, Taspasel 
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corresponding to aa 124-137 and aa 138-146 of an un- 
charactedzed open reading frame (orl) present on chro- 
mosome 20 (c20orfl3) were Identified from a gel slice 



that migrated at ~28 kDa (Figure 3B). The orf predicts a 
420 aa protein possessing an Asparaginase 2 homology 
domain (pfam 01112, interpro 000246) from aa 41-391 
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Figure 4. Cleavage of MLL Substrate by Re- 
combinant Taspasel 

(A) Purified recombinant Taspasel (rTas- 
pasel) at 100 ng processes ^-methionine- 
labeled p75 wt MLL substrate (left Image) but 
not p75 CS1/2 mt substrate (right image). 
Taspasel processed p75 more completely 
than trie final mono S fraction utilized in Fig- 
ure 3B (lane 2 and 3, (eft image). 

(B) Taspasel sequential cleavage of CS1 and 
CS2. ''S-methionine labeled p7S wt MLL sub- 
strate was Incubated with 1 ng or 10 ng rTas- 
pasel at a final volume of 25 ^ and samples 
were analyzed over time by SOS-PAGE and 
autoradiography for cleavage. The more con- 
served CS2 was cleaved (p47 fragment) be- 
fore CS1 (p47 fragment) at 1 ng rTaspasel 
(upper Image). 10-fold more rTaspasel (10 
ng) Increased C81 processing (lower image). 



(Figure 3C). Typical proteins that contain this signature 
motif include L-asparaginase and glycosyfasparaginase 
(Figure 3C), L-asparaginase catalyzes an amide bond 
hydrolysis to convert L-asparagine to L-aspartate. Gly- 
cosyiasparaglnase is an amidohydrolase, which cata- 
lyzes the N-acetylglucosamlne-asparagine bond that 
links oligosaccharides to asparaglne. However, no en- 
dopeptidase activity had been demonstrated to date 
among the Asparaginase^ family enzymes. The charac- 
teristics of this MLLcleavfng protease (c20orfl 3) as sub- 
sequently detailed here prompt its designation as Tas- 
pasel (Threonine aspartasel). Sequence alignment 
searches of the public databases identified highly con- 
served Taspasel orthologs In fly, mosquito, pufferfish, 
zebrafish, rat, mouse, and human (Figure 3C, Supple- 
mental Figure S1 available at http^/www.cell.com/cgi/ 
content/full/115/3/293/DC1), but not in the nematode, 
C. etegans. The conserved LDTVG motif that surrounds 
a putative threonine active site is distinct from L-aspara- 
ginase and glycosytasparaginase (Figure 3C) suggesting 
it may have a unique specificity. 

Recombinant Taspasel Activity 
To assess the potential activity of this candidate prote- 
ase, we expressed and purified recombinant N-terminal 
Hls-tagged Taspasel from E. coU. Recombinant Tas- 
pasel (rTaspasel) cleaved the wt p75 MLL but not the 
CS1/2 mt substrate (Figure 4A). rTaspasel more effi- 
ciently processed p75 MLL to completion based on the 
predominance of the p42 fragment when compared to 
the activity In the original mono S fraction (Figure 4A). 



The p42 fragment results from cleavage at the CS1 
(D/GADD) site, which Is less conserved than CS2 (D/GVDD) 
and is also less efficiently processed in vivo (Hsleh et 
al., 2003). We next examined the differential sensitivity 
of CS1 and CS2 sites for cleavage by rTaspasel . The 
p47 fragment appears first, while higher enzyme con- 
centration or more time is needed for the appearance 
of the p42 fragment indicating a preference of rTaspasel 
for CS2 over CS1 (Figure 4B). Thus, the cloned Tas- 
pasel 's proteolytic characteristics match the pattern of 
endogenous MLL proteolysis. 

The First Threonine 4spartase, Taspasel 
Purification of recombinant N-terminal Hls-tagged Tas- 
pasel yielded an expected 50 kDa product, a His-tagged 
28 kDa a subunit, and a copurified 22 kDa polypeptide 
(Figure 5A). This 22 kDa polypeptide was subjected to 
N-terminal Edman degradation analysis, which identi- 
fied threonine 234 of Taspasel as the N-terminal amino 
acid of the apparent 22 kDa ^ subunit (Figure 5A). This 
represents proteolysis between aspartate 233 and thre- 
onine 234 of the 50 kDa putative proenzyme. Similarly 
when an N-terminal and C-terminal epitope tagged hu- 
man Taspasel cDNA was expressed in the human 293 
T cell line, the 50 kDa product aJso appeared to be 
processed to a 28 kDa N-terminal a subunit and a 22 
kDa C-termlnal p subunit (Figure 3D). This suggested 
that Taspasel may be Intramolecularly proteolyzed and 
processed subunlts reassembled through a noncovalent 
association. Amino acid substitution of either aspartate 
233 (D233A) or threonine 234 (T234A) to alanine each 
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Figure 5. Taspasel Proenzyme Undergoes 
Intramof ocular Processing to an Active 28 
kDa a/22 kDa 0 Heterodlmer 

(A) Cobalt column purification of bacterial- 
expressed, N-terminal His-tagged wt and 
mutant Taspasel . A 22 kDa fragment copu ri- 
fled with the His-tagged 28 kDa a subunlt of 

. Taspasel (lane 1). Edman degradation of the 
22 kDa fragment revealed a 14 amino acid 
sequence of Taspasel beginning at threonine 
234 (asterisk), which proved to be the 22 kDa 
B subunlt that heterodlmerizes with Hb-a28. 
Anti-Hls tag Ab recognized both the 50 kDa 
His-proenzyme and the processed His-a28 
but not the processed 022 of rTaspasel (data 
not shown). Alanine substitutions of aspar- 
tate 233 (D233A) or threonine 234 (T234A) 
abolish the Intramolecular proteolysis of Tas- 
pasel (lanes 2 and 3). 

(B) Purified rTaspasel In a dose-response 
assay cleaves *S-methlonine-labeled p75 wt 
MLL substrate. The D233A mutant rTaspasel 
{middle image) demonstrated an M, 000-fold 
decrease In MLL proteolysis. The T234A mu- 
tant rTaspasel demonstrated no enzymatic 
activity consistent with T234 being the active 
she nucleophlle Cower Image). 



abolished the intramolecular processing of Taspasel 
expressed In £ coll (Figure 5A) or in mammalian cells 
(Figure 3D). However, the D233A mutant retained some 
residual enzymatic activity, although it was M 000-fold 
less efficient than the wt enzyme (Figure 5B, middle 
Image). Conversely, threonine 234 which became the 
N terminus of the p subunit is absolutely essential for 
cleavage activity (Figure 5B, lower image). These enzy- 
matic characteristics are similar to properties shared by 
L-asparaglnase and glycosylasparaginase which also 
demonstrate autoproteolysis of a proenzyme Into an 
active a/0 heterodimeric enzyme in which the N-terminal 
threonine of the p subunit is the active site nucleophiJe 
for catalysis (Guan et aL, 1 996; Uu et al., 1 998; Tlkkanen 
et al., 1996; Xu et aL, 1 999). Thus, to our knowledge, this 
MLL cleaving protease is the only known endopeptldase 
within the Asparaginase^ family with the characteristic 
of being a threonine aspartase, Taspasel. 

Proteolysis of MLL In Vivo Requires Taspasel 
We next asked whether Taspasel was required to cleave 
MLL within mammalian cells. As a model system to test 
specificity, we coexpressed the p75 MLL substrate re- 
porter together with Taspasel in 293T cells. Taspasel 
resulted In cleavage of wt p75 MLL but not the p75 CS1/2 
mt reporter (Figure 6A). Wt Taspasel, but not the T234A 
mutant Taspasel , enhanced the processing of p75 MLL 
to the final p42 product (Figures 1A and 6A). Only wt 



Taspasel , but not the T234A mutant of the nucleophile 
site demonstrated intramolecular processing Into a/p 
fragments (Figures 3D and 6A). To assess the role of 
endogenous Taspasel, we designed duplex RNAi 
against Taspasel , which knocked down the expression 
of endogenous Taspasel by ~80% In HeLa cells (Fig- 
ures 6B and 6C). This reduction of endogenous Tas- 
pasel severely impaired the cleavage of transacted p75 
wt MLL reporter as Judged by the marked decrease in 
the processed p47 fragment (11 ±3% of control RNAI; 
n = 3 experiments; p < 0.005) and the disappearance 
of the p42 fragment (Figure 6B). Taspasel RNAI knock- 
down also resulted in a ~50% decrease In the endoge- 
nous, processed CI 80 MLL fragment as well as an in- 
creased abundance of full-length p500 MLL (4.3-fold 
increase over control RNAi; n = 3 experiments; p < 
0.005) (Figure 6C). In contrast, MLL RNAi resulted in the 
marked reduction of the CI 80 MLL fragment, but did 
not Increase p500 MLL (Figure 6C). In total, both the in 
vitro and in vivo cleavage assays confirm the role of 
Taspasel in the proper processing of MLL 

Taspasel Is Required for Proper HOX 
Gene Expression 

Genetic studies In both mice and flies establish that Mil 
and trithorax regulate Hox and homeotic gene expres- 
sion, respectively (Mazo et aL, 1990; Yu et al., 1995). 
Mice heterozygous for an Mll-exon^ disruption dem- 
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Figure 6. Taspasel Is Required tor Endogenous MLL Cleavage and Proper HOX Gene Regulation 

(A) Constructs expressing wt or mt p75 MLL substrate (as In Figure 1C) were cotransfected with wt or mt (T234A) Taspasel bearing an 
N-termlnal proteln-C tag into 293T cells. The N-termlnal cleavage products of p76 MLL were detected by an anti-Myc Western Wot 
cleavage of wt p75 MLL substrate by endogenous protease Is shown in lane 1 , upper Image. Transfectlon of wt Taspasel enhanced complete 
processing of p75 MLL processing, Judged by the decrease of both p75 and p47, and the Increase of the p42 fragment Canes 2, upper Image), 
No enhancement was observed with transacted T234A mutant (lane 3, upper Image). The Intramolecular proteolysis of expressed wt but not 
the T234A mutant Taspasel is demonstrated by antlprotein C Western blot (lower Image). 

(S) RNAI-medlated knockdown of endogenous Taspasel Impairs the processing of p75 wt MLL Hela cells were treated with indicated duplex 
RNAI for 48 hr before Introduction of the P 75 wt MLL expression construct and analysis of cleavage 24 hr later. Taspasel knockdown leads 
to a marked decrease In the p47 and the disappearance of the p42 fragment (lane 2). 

(C) RNAi-mediated knockdown of endogenous Taspasel results In Increased abundance of unprocessed p500 Ma and diminished Processed 
C180 MLL (lane 2). The Indicated duplex RNAi was Introduced Into HeLa cells and the expression of Taspaset a28, p500 MLL, CI 80 MLL, 
and actin were detected by the respective Abs. MLL RNAI markedly reduced the expression C180 MLL (lane 3). Control duplex RNAi had no 

(D) ^ter^d HOX A gene expression In Taspasel knockdown cells. Quantitative RT-PCR analysis of Indicated gene expression of RNAI-treated 
HeLa cells Is shown as a bar graph. The expression of Individual genes Is normalized to GAPDH levels for Individual samples. The % 
expression of an Indicated gene is calculated In comparison to a control duplex RNAi set at 100%. Values shown are mean ± 1 SD of four 
Independent experiments. 
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onstrate haploinsufficiency with bidirectional homeotic 
transformations and shifted anterior boundaries of sev- 
eral Hox genes (Yu etaf., 1995), Mir'- deficient embryos 
and mouse embryonic fibroblasts (MEFs) demonstrate 
Mil is required for the maintenance of selected Hox gene 
expression (Hanson et al., 1999; Yu et ah, 1995, 1998). 
As a first assessment of whether reduced Taspasel 
activity would alter gene expression, we examined the 
gene expression profile of HeLa cells treated with the 
Taspasel RN Ai versus a control RNAI (Figure 6D). Initial 
analysis of Affymetrix (HG-U133A) oligonucleotide array- 
based RNA profiles indicated diminished expression of 
selected HOX genes (data not shown). Consequently, 
we used a quantitative RT-PCR approach to determine 
the relative expression of genes across the HOX A clus- 
ter. Of note, the knockdown of Taspasel diminished the 
expression of the 3' located and "earlier" expressed 
genes In the HOX A cluster (At, A3, and A4), but not 
those genes located more 5' and expressed "later" dur- 
ing embryonic development (A5, A9 t and AfO) (Figure 
6D). This selected attenuation contrasts with the global 
decrease in expression of most HOX A genes (Af to 
A10) in cells with MLL knocked down (Figure 6D). These 
data support the importance of Taspasel In the correct 
expression of the early HOX A gene cluster (equivalent to 
the ANT-C cluster of DrosophJla) (Figure 7). 

Discussion 

We utilized highly conserved cleavage sequences within 
MLL to purify and clone a novel protease responsible 
for the routine proteolysis of MLL. Taspasel (threonine 
aspartame 1) is an endopeptldase within a family of en- 
zymes possessing an Asparaginase^ homology do- 
main. Other members present in both prokaryotes and 
eukaryotes Include the amidohydrotases, L- asparagi- 
nase in intermediary amino acid metabolism and gry co- 
sy! asparaginase. Glycosylasparaginase participates in 



the ordered degradation of N-llnked glycoproteins by 
cleaving Asn-GlcNAc linkages that Join oligosaccha- 
rides to proteins. Taspasel -mediated cleavage of MLL 
follows distinct aspartate residues suggesting Taspasel 
evolved from hydrolyzing asparaglne and glycosylaspara- 
gine to recognize a conserved peptide motif with an 
aspartate at the P1 position. Taspasel also belongs to 
a larger superfamily of diverse enzymes that are Ntn- 
hydrolases (rV-ferminal micleophiles) (Brannigan et a!., 
1 995). All members of this family are activated autocata- 
rytlcally, displaying an N terminally located catalytic nu- 
cleophile. The nascent single polypeptide is cleaved Into 
nonidentical a and p subunits, which assemble into a 
catalyticaJry active appa heterodimeric structure (Oino- 
nen and Rouvinen, 2000). Glycosylasparaginase is a 
well-characterized prototype that serf-catalyses a pep- 
tide bond rearrangement Of note, the threonine that 
provides the nucleophile for both autoproteolysis and 
hydrolase activity as well as the Immediately upstream 
aspartate residue are conserved in Taspasel . The threo- 
nine nucleophile of glycosylasparaginase appears to at- 
tack this immediately upstream amide bond through an 
N->0 acyl shift to catalyze this peptide bond cleavage 
(Xu et al., 1999). In the Ntn-hydrolase superfamily the 
nucleophile always originates from the N terminus of 
the p subunit The complete dependence of Taspasel 
upon thr234 for both its intramolecular proteolysis to 
generate an a/p heterodimer as well as its proteolytic 
activity for MLL substrates supports Its role as the active 
nucleophile and the Inclusion of Taspasel In this family. 

Caspases and Granzyme 6 are well-characterized 
proteases that also cleave peptide bonds after a P1 
aspartate (Odake et al., 1991; Thomberry et al., 1992). 
However, they belong to structurally distinct, large fami- 
lies of proteases that use cysteine or serine as the active 
nucleophile. Among the known proteases, the p subunit 
of the 20S proteasome, the p subunit of the archea 
proteasome, and the catalytic subunit of E coll HsIV are 
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known threonine endopeptidases (Bochtler et al. p 1999; 
Seemuller et aL, 1995). WhHe the 20S proteasome may 
be responsible for site-specific cleavage of precursor 
proteins such as p105 NFkB (Palombella et al., 1994), 
It predominantly participates in protein degradation 
(Hochstrasser, 1996). 

MLL family proteins have architecturally positioned 
domains with distinct biochemical properties (Figure 1 A 
and Supplemental Figure S2 available at Cell website). 
Vertebrate MLL has two conserved cleavage sites, CS1 
and CS2 (Hsieh et al. f 2003), Of note, other homologs 
including invertebrate trithorax and vertebrate MLL2 
only possess the more conserved CS2 site. These cleav- 
age sites are strategically positioned such that proteo- 
lyzed MLL fragments form a heterodimeric complex uti- 
lizing the FYRN domain of N320 and the FYRC plus 
SET domains of the C180 fragment (Hsieh et ah, 2003). 
Consequently, the postcleavage reassembly of MLL 
fragments would be predicted to bring four domains 
(i.e., PHD4, FYRN, FYRC, and SET) in close proximity. 
This postcleavage juxtaposition is reminiscent of the 
contiguous four-domain cluster, PHD4-FYRN-FYRC- 
SET, found at the C terminus of more evolutlonarily dis- 
tant MLL homologs (Supplemental Figure S2 available 
at Cell website). This latter set of proteins is present 
in all Metazoa Including T1 2D8.1 of C. efegans and trr 
ifnthorax-retated) of Dmsophila, ALB i£LL1 Related) 
and MLL3 of Homo sapiens (Supplemental Figure S2 
available at Cell website). These proteins with clustered 
domains do not display the cleavage sequence and re- 
combinant MLL3 was not proteolyzed (data not shown). 
The proximity of these four domains may be essential for 
a common biochemical property of this protein family. 
Higher organisms with complex body plans require addi- 
tional levels of regulation for specification programs in- 
cluding those mediated by Hox genes. Chordata and 
Arthrvpoda possess MLL homologs with motifs proteo- 
lyzed by the coevolved Taspasel, which distinguishes 
them from the more ancient four-domain cluster. 

Deregulated expression of selected HOX genes ac- 
companies some human malignancies including leuke- 
mias bearing chromosomal translocations that result in 
MLL-fusion proteins (Armstrong et al. ( 2002; Yeoh et aL, 
2002). Recently, the transformation of myeloid progeni- 
tors mediated by an MLL-ENL fusion protein was shown 
to depend on selected downstream Hox genes (Ayton 
and Cleary, 2003). The discovery of Taspasel provides 
a unique opportunity to probe the role of MLL cleavage 
In both physiologic and pathologic settings. Finally, our 
data warrant development of a Taspasel inhibitor that 
would be predicted to impair expression of selected 
HOX genes and might prove a rational approach to can- 
cer therapy. 

Homeotlc genes of the invertebrates and Hox genes of 
the vertebrates encode a family of related transcription- 
regulatory proteins that play essential roles in imple- 
menting developmental patterns and conferring critical 
positional information (Krumlauf, 1994). The initiation of 
Hox gene expression is controlled by transiently ex- 
pressed segmentation gene products. However, the 
maintenance of appropriate Hox expression patterns is 
conferred by the positive regulatory trithorax group (frx-G) 
and the negative regulatory potycomb group (Pc-G), 
which heavily utilize epigenetic mechanisms (Pirrotta, 
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1998). Mil loss-of-functlon studies Indicate it is required 
for maintaining Hox expression during early embryogen- 
esls (Yu et al., 1995). Moreover, the axial-skeletal trans- 
formations and altered Hox patterns of AW-deficIent 
mice were normalized when PcQ member Bm/-1 was 
deleted (Hanson et al., 1999). The CS2 (D/GVDD) cleav- 
age site in Drosophlla TRX resides within a 281aa region 
deleted In a mutant trx allele, trx* (Mazo et al., 1990). 
Null mutations of trx fall to maintain the expression of 
homeotlc genes of both the blthorax (BX-C) and anten- 
napedla (AJvT-C) complexes (Breen, 1999). Whereas, 
trx* displays selectively reduced expression o1ANT-C 
but not BX-C genes (Sedkov et al., 1994). Subsequent 
studies using antibodies recognizing different regions of 
TRX support the CS2 sequences serving as a proteolytic 
site (Kuzin et al., 1994). Similarly, our RNAI knockdown 
of Taspasel in mammalian cells suggests that cleavage 
of MLL is important In maintaining expression of the 
"early" 3' located HOX A genes (HOX At, A3, A4 analo- 
gous to ANT-C) but not the "late" more 5' located HOX 
A genes (HOX A9, A10 analogous to BX-C) (Figure 7). 
In contrast, RNAI knockdown of MLL and our M//-nulI 
cells display more global defects in Hox gene expression 
similar to trx-null alleles (data not shown). Perhaps the 
earliest expressed 3' HOX genes are most dependent 
upon processed MLL, raising the possibility that the 
precise multiprotein complexes assembled by MLL 
might vary along the HOX gene cluster. A role for Tas- 
pasel -mediated cleavage of MLL in proper HOX gene 
expression suggests Taspasel coevolved with MLUtrl- 
thorax at the time when Arthrvpoda and Chordata 
emerged from Metazoa, enabling the more complex 
axial skeletal and developmental plans of higher or- 
ganisms. 

Experimental Procedures 

Plasmfd Construction and Antibody Production 
PCR fragments consisting of MLL aa 2400-2900 derived from either 
wfld-type or noncleavaWe MIX mt (CS1/CS2 mt) were Inserted Into 
a Myc/Rag doubly tagged eukaryotlc expression vector for transient 
transection assays. These constructs also contain a5'T7 promoter 
for generating in vitro transcription/translation product of "S-methi- 
onlne- labeled p75 MLL substrates. Full-length Taspasel was cloned 
from 293T cell cDNA and inserted Into the Myc/Flag doubly tagged 
expression vector, a protein C-tagged vector, and a His -tagged 
bacteria expression vector, ET1 5b (Novagen). Taspasel mutants 
were generated using QuikChange site-directed mutagenesis kit 
(Stratagene). Rabbit antKTaepasel polyclonal antibody was raised 
against aa 7-212 of purified recombinant human Taspasel . Tran- 
sient transection, in vitro transcription/translation, "S-methlonlne 
labeling, and immunobfot assays were performed as previously de- 
scribed (Hsieh et al., 2003). 

In Vitro Cleavage Assays 

"S-methlonlne-labeled MLL substrate was incubated with 2 u,l of 
indicated subcellular fractions or specified amounts of rTaspasel 
in cleavage buffer (100 mM HEPES [pH 7.9], 5 mM MgCI* 20 mM 
KO, 5 mM DTT, and 10% sucrose) at a final volume of 25 for 1 hr 
at 37°C or Indicated periods of time. Protease inhibitors utilized 
Include 8.5 \lM phosphoramkJon, 100 jiM TLCK, 100 jiM TPCK, 
5 mM todoacetamide, 5 mM N-ethytmaleimlde, 0.3 \lM aprotlnln, 
100 p.M leupeptin, 1 jiM pepstatin, 1 x complete protease Inhibitor 
cocktail (Roche), 100 anti-pain, 100 u.M APMSF, 10 *iM bestatin, 
25 jaM ALLN, 100 u.M chymostatin, 1 0 pJM E-64, 5 mM EDTA, 1 mM 
PMSF, 1 mM EGTA, 50 yJM BAF (Boo-As partyi-FMK), and 50 nM 
z-VAD (z-VAD-FMK). 
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Purification and LC-MS/MS 

Human 293T cells from on© hundred 15 cm dishes were collected 
and Incubated In hypotonic buffer (10 mM HEPES [pH 7.9], 1.5 mM 
MgCI* 10 mM KCl, 0.2 mM PMSF, 1 mM EDTA, 1 mM EDTA, 1 mM 
EGTA, and protease Inhibitor cocktail [Roche]) for 15 mln on Ice. 
Supplemented protease Inhibitors were omitted during the initial 
characterization of MLL cleaving protease. Cells were homogenized 
using a glass dounce and. the homogenized cellular extract was 
subjected to centrifugal fractionation. Nuclei (Nuc) and unbroken 
cells were twice separated at 700 g for 10 mln. The heavy membrane 
(HM) fraction pellet was collected after two centrifugatlons of the 
supernatant at 7000 g for 10 mln. The resulting supernatant was 
centrtfuged at 100,000 g for 30 mfn to yield the light membrane 
pellet (LM) and final soluble fraction (SI 00). Proteins were solubilized 
in buffer A (20 mM HEPES [pH 7.9], 100 mM KCl, 1.5 mM MgClj, 0.2 
mM PMSF, 1 mM DTT, 1 mM EDTA, 1 mM EGTA, 0.1% Tween 20, 
and 10% glyceroO with additional 0.5% of Tween 20. Solubilized 
LM fraction was applied to a Pi 1 column and the bound protease 
was eluted wtth gradients of KCl. Positive fractions were collected 
and cfialyzed against buffer B (10 mM HEPES (pH 7.9], 100 mM KCl, 
1 mM MgCla. 10 uM CaCI* 0.2 mM PMSF, 1 mM DTT, 1 mM EDTA, 

I mM EGTA, 10% glycerol, 0.1% Tween 20, and 10 mM potassium 
phosphate [pH 7.9]) and applied to a hydroxyapatlte column. Elution 
was performed with a phosphate gradient and the protease positive 
fractions were dialyzed against buffer A before loading onto indi- 
cated chromatographic columns. LC-MS/MS was performed by the 
Taplin Biological Mass Spectrometry Facility at the Harvard Medi- 
cal School. 

Recombinant Enzyme and Edman Degradation Analysis 
His-tagged Taspasel was expressed In BL21 (DE3) cells and purified 
with a TALON column (Oontech). N-terminal protein sequencing 
was performed by Molecular Biology Core Facility at the Dana- 
Farber Cancer Institute. 

RNAI, Reverse Transcription, and Quantitative PCR 
HeLa cells were transfectsd with indicated duplex RNAI (Dhar- 
macon) using ollgofectamlne (Invrtrogen). Double-stranded ribooli- 
gonucteo tides with overhanging 3' deoxy TT were prepared to target 
mRNAs of either hTaspasel (GACUCACAUUUCAAGACUU) or hMLL 
(GAAQUCAGAGUGCGAAGUC). Cells harvested 72 hr after transfec- 
tlon were either lysed in RJPA buffer for Immunoblots or with Trizol 
(Invltiogen) for RNA purification using RNeasy (Qiagen). Transfectlon 
of p75 wt MLL reporter substrate was performed 48 hr following 
individual RNAI knockdown In HeLa cells and the cellular extracts 
were harvested 24 hr later for immunoblot analysis. Reverse tran- 
scription were performed with oligo-dT primers using Superscript 

II (Invitrogen). Quantitative PCR was performed In triplicate using 
indicated gene specific primers (Supplemental Data available at Cefl 
website) with SYBR green (PE biosystems) on the ABI Prism 7700 
sequence detection system. 
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